Scalable Exploratory Data Mining of Distributed Geoscientific Data

نویسندگان

  • Eddie C. Shek
  • Richard R. Muntz
  • Edmond Mesrobian
  • Kenneth W. Ng
چکیده

Geoscience studies produce data from various observations , experiments, and simulations at an enormous rate. Exploratory data mining extracts \content information" from massive geoscientiic datasets to extract knowledge and provide a compact summary of the dataset. In this paper, we discuss how database query processing and distributed object management techniques can be used to facilitate geoscientiic data mining and analysis. Some special requirements of large scale geoscientiic data mining that are addressed include geoscientiic data modeling, parallel query processing, and heterogeneous distributed data access.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

An Overview of Data Mining Applications in Oil and Gas Exploration: Structural Geology and Reservoir Property-Issues

Low oil prices have motivated energy executives to look into cost reduction in their supply chains more seriously. To this end, a new technology that is experimentally considered in hydrocarbon exploration is data mining. There are two major categories of geoscientific problems in which data mining is applied: structural geology and reservoir property-issues. This research overviews these categ...

متن کامل

geoPOM: A Heterogeneous Geoscientific Persistent Object System

Lately, a need for uniform access to and integration of data stored in specialized, non-standard repositories such as GIS or multimedia storage servers has become apparent. In this paper, we provide an overview of a heterogeneous geoscientific persistent object manager (geoPOM) developed at the UCLA Data Mining Laboratory. GeoPOM provides users with the “illusion” of a single object-oriented sp...

متن کامل

Extending GeoToolKit to Access Distributed Spatial Data and Operations

The integration of extremely heterogeneous geoscientific data is a challenging problem. To provide a non-redundant maintenance and cooperative utilisation of data by different applications developed within the interdisciplinary geo-scientific research centre at the University of Bonn we elaborated a technology based on GeoToolKit a component-based software development tool. In the paper we pres...

متن کامل

A Parallel MapReduce Algorithm to Efficiently Support Itemset Mining on High Dimensional Data

In today’s world, large volumes of data are being continuously generated by many scientific applications, such as bioinformatics or networking. Since each monitored event is usually characterized by a variety of features, highdimensional datasets have been continuously generated. To extract value from these complex collections of data, different exploratory data mining algorithms can be used to...

متن کامل

Scalable and Visualization-oriented Clustering for Exploratory Spatial Analysis

Clustering can be applied to many fields including data mining, statistical data analysis, pattern recognition, image processing etc. In the past decade, a lot of efficient and effective new clustering algorithms have been proposed, in which famous algorithms contributed from the database community are CLARANS, BIRCH, DBSCAN, CURE, STING, CLIGUE and WaveCluster. All these algorithms try to chal...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1996